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DETAILED ACTION 

1 . This office action is in response to application 1 0/781 ,880 filed February 20, 
2004. Claims 1-16 are pending in the application and have been examined. 

Priority 

2. The application claims priority to Taiwan application 092132768 filed November 
21 , 2003. This priority date has been considered in this office action. 

Information Disclosure Statement 

3. The Information Disclosure Statement filed January 10, 2007 has been 
considered in this office action. 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. The factual inquiries set forth in Graham v. John Deere Co., 383 U.S. 1 , 148 
USPQ 459 (1966), that are applied for establishing a background for determining 
obviousness under 35 U.S.C. 103(a) are summarized as follows: 

1 . Determining the scope and contents of the prior art. 

2. Ascertaining the differences between the prior art and the claims at issue. 

3. Resolving the level of ordinary skill in the pertinent art. 
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4. Considering objective evidence present in the application indicating 
obviousness or nonobviousness. 

6. Claims 1-9, and 12-16 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over the admitted prior art (APA) in view of Smith et al. (US PAP 
2001/0047263). 

7. Consider claim 1 , the APA teaches a voice interactive method comprising: 

a) performing voice recognition upon an input voice signal to detect presence of 

a predetermined keyword (a confirmation procedure is required in the Talk-to-Talk mode 
when issuing a voice command. In the Talk-to-Talk mode, the confirmation procedure 
involves the presence of a keyword in the issued voice command so as to minimize 
occurrence of unwanted responses; spec page 3, lines 8-12.); 

b) upon detecting that the input voice signal contains the predetermined keyword, 
performing semantic recognition upon the input voice signal (voice recognition is 
performed upon an input voice signal, and a responsive command is subsequently 
retrieved from a database based on the recognition result, spec page 2 lines 2-5. Talk to 
talk operates similarly as push to talk except a keyword is used; spec page 3, lines 7. 
Semantic analysis is inherent in the system of example of page 3 of talk-to-talk 
system.); 

c) generating a response according to result of the semantic recognition 
performed in step b) (voice recognition is performed upon an input voice signal, and a 
responsive command is subsequently retrieved from a database based on the 
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recognition result, spec page 2 lines 2-5. Talk to talk operates similarly as push to talk 
except a keyword is used; spec page 3, lines 7.). 
But the APA does not specifically teach: 

d) simultaneous with step b), calculating an idle time between a current input 
voice signal and a previous input voice signal; and 

e) disabling the semantic recognition of the input voice signal, and repeating step 
a) when the idle time calculated in step d) is larger than a predetermined threshold. 

In the same field of voice control systems, Smith teaches: 
d) simultaneous with step b), calculating an idle time between a current input 
voice signal and a previous input voice signal (Figure 6, ); and e) disabling the semantic 
recognition of the input voice signal, and repeating step a) when the idle time calculated 
in step d) is larger than a predetermined threshold (Typically, the wait state of step 620 
will last a predetermined amount of time, such as three seconds, and if no input is 
received (step 650), the system will display and ask the user verbally to input what type 
of command they wish to enter such as a command to call a specific name, phone 
number or to review a directory of names (step 655). Processing then returns to the 
command wait step 620. However/if no command is input by the user again within the 
predetermined amount of time (step 650), the system will go back to step 600 and await 
another attention word or key; paragraph 0060.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the timing method of Smith with the voice control system of 
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the APA in order to provide a mechanism to disable the semantic analysis when no 
response is given in order to prevent unwanted responses. 

8. Consider claim 2, the APA teaches the voice interactive method as claimed in 
claim 1 , wherein step c) includes: generating a signal corresponding to the result of the 
semantic recognition performed in step b), and transmitting the signal to an electronic 
device such that the electronic device operates in response to the signal received 
thereby (In the example, it is assumed that the system keyword is "Jack", and the 
electronic device that incorporates the voice interactive system is a multi-media 
playback apparatus: User: Jack, activate the CD player. System: All right, I'll activate the 
CD player for you; spec page 3, lines 18-24.). 

9. Consider claim 3, the APA the voice interactive method as claimed in Claim 1 , 
wherein step c) includes generating an artificial voice response corresponding to the 
result of the semantic recognition performed in step b) (In the example, it is assumed 
that the system keyword is "Jack", and the electronic device that incorporates the voice 
interactive system is a multi-media playback apparatus: User: Jack, activate the CD 
player. System: All right, I'll activate the CD player for you; spec page 3, lines 18-24.). 

10. Consider claim 4, Smith teaches the voice interactive method as claimed in claim 
1 , wherein step c) includes generating an image that corresponds to the result of the 
semantic recognition performed in step b) (FIGS. 10a-10f show example screen 
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displays according to the present invention. FIG. 10a shows the basic screen display 
with the users selections to dial by name 100 or by number 200. The name list selection 
300 allows the user to view the directory of names, such as the directory shown in FIG. 
1 1 . After an attention word is entered into the system, icon 300 shown in FIG. 10b is 
displayed on the screen to indicate to the user that the system is on and waiting for a 
command. Throughout processing the telephone call, icon 300 is displayed whenever it 
is time for user input; paragraph 0070.). 

1 1 . Consider claim 5, the APA teaches a selective voice recognition method 
comprising: 

a) performing voice recognition upon an input voice signal to detect presence of 

a predetermined keyword (a confirmation procedure is required in the Talk-to-Talk mode 
when issuing a voice command. In the Talk-to-Talk mode, the confirmation procedure 
involves the presence of a keyword in the issued voice command so as to minimize 
occurrence of unwanted responses; spec page 3, lines 8-12.); 

b) upon detecting that the input voice signal contains the predetermined keyword, 
performing semantic recognition upon the input voice signal (voice recognition is 
performed upon an input voice signal, and a responsive command is subsequently 
retrieved from a database based on the recognition result, spec page 2 lines 2-5. Talk to 
talk operates similarly as push to talk except a keyword is used; spec page 3, lines 7. 
Semantic analysis is inherent in the system of example of page 3 of talk-to-talk 
system.); 
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c) generating a response according to result of the semantic recognition 
performed in step b) (voice recognition is performed upon an input voice signal, and a 
responsive command is subsequently retrieved from a database based on the 
recognition result, spec page 2 lines 2-5. Talk to talk operates similarly as push to talk 
except a keyword is used; spec page 3, lines 7.). 

But the APA does not specifically teach: 

d) simultaneous with step b), calculating an idle time between a current input 
voice signal and a previous input voice signal; and 

e) disabling the semantic recognition of the input ,voice signal, and repeating step 
a) when the idle time calculated in step d) is larger than a predetermined threshold. 

In the same field of voice control systems, Smith teaches: 
d) simultaneous with step b), calculating an idle time between a current input 
voice signal and a previous input voice signal; and e) disabling the semantic recognition 
of the input voice signal, and repeating step a) when the idle time calculated in step d) is 
larger than a predetermined threshold (Typically, the wait state of step 620 will last a 
predetermined amount of time, such as three seconds, and if no input is received (step 
650), the system will display and ask the user verbally to input what type of command 
they wish to enter such as a command to call a specific name, phone number or to 
review a directory of names (step 655). Processing then returns to the command wait 
step 620. However, if no command is input by the user again within the predetermined 
amount of time (step 650), the system will go back to step 600 and await another 
attention word or key; paragraph 0060 ). 
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Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the timing method of Smith with the voice control system of 
the APA in order to provide a mechanism to disable the semantic analysis when no 
response is given in order to prevent unwanted responses. 

12. Consider claim 6, the APA teaches a voice interactive system (a system is 
inherent in the APA in order to implement features below) comprising: 

a detecting module adapted for performing voice recognition upon an input voice 
signal to detect presence of a predetermined keyword (a confirmation procedure is 
required in the Talk-to-Talk mode when issuing a voice command. In the Talk-to-Talk 
mode, the confirmation procedure involves the presence of a keyword in the issued 
voice command so as to minimize occurrence of unwanted responses; spec page 3, 
lines 8-12.); 

a semantic recognition module coupled to and controlled by said detecting 
module so as to switch operation from a disabled mode to an enabled mode, where said 
semantic recognition module performs semantic recognition upon the input voice signal, 
when the presence of the predetermined keyword in the input voice signal is detected 
by said detecting module (voice recognition is performed upon an input voice signal, 
and a responsive command is subsequently retrieved from a database based on the 
recognition result, spec page 2 lines 2-5. Talk to talk operates similarly as push to talk 
except a keyword is used; spec page 3, lines 7. Semantic analysis is inherent in the 
system of example of page 3 of talk-to-talk system.); 
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a response module coupled to and controlled by said semantic recognition 
module so as to generate a response according to result of the semantic recognition 
performed by said semantic recognition module (voice recognition is performed upon an 
input voice signal, and a responsive command is subsequently retrieved from a 
database based on the recognition result, spec page 2 lines 2-5. Talk to talk operates 
similarly as push to talk except a keyword is used; spec page 3, lines 7.). 

But the APA does not specifically teach: 

a timer module which operates simultaneously with operation of said semantic 
recognition module in the enabled mode so as to calculate an idle time between a 
current input voice signal and a previous input voice signal, and so as to determine 
whether the idle time calculated thereby is larger than a predetermined threshold; and 

a mode switching module coupled to said timer module and said detecting 
module, said mode switching module enabling said detecting module to switch 
operation of said semantic recognition module from the enabled mode back to the 
disabled mode upon detection by said timer module that the idle time between the 
current input voice signal and the previous input voice signal is larger than the 
predeterm ined threshold . 

In the same field of voice control systems, Smith teaches: 

a timer module which operates simultaneously with operation of said semantic 
recognition module in the enabled mode so as to calculate an idle time between a 
current input voice signal and a previous input voice signal, and so as to determine 
whether the idle time calculated thereby is larger than a predetermined threshold 
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(Typically, the wait state of step 620 will last a predetermined amount of time, such as 
three seconds, and if no input is received (step 650), the system will display and ask the 
user verbally to input what type of command they wish to enter such as a command to 
call a specific name, phone number or to review a directory of names (step 655). 
Processing then returns to the command wait step 620. However, if no command is 
input by the user again within the predetermined amount of time (step 650), the system 
will go back to step 600 and await another attention word or key; paragraph 0060.)); and 

a mode switching module coupled to said timer module and said detecting 
module, said mode switching module enabling said detecting module to switch 
operation of said semantic recognition module from the enabled mode back to the 
disabled mode upon detection by said timer module that the idle time between the 
current input voice signal and the previous input voice signal is larger than the 
predetermined threshold Typically, the wait state of step 620 will last a predetermined 
amount of time, such as three seconds, and if no input is received (step 650), the 
system will display and ask the user verbally to input what type of command they wish 
to enter such as a command to call a specific name, phone number or to review a 
directory of names (step 655). Processing then returns to the command wait step 620. 
However, if no command is input by the user again within the predetermined amount of 
time (step 650), the system will go back to step 600 and await another attention word or 
key; paragraph 0060.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the timing method of Smith with the voice control system of 
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the APA in order to provide a mechanism to disable the semantic analysis when no 
response is given in order to prevent unwanted responses. 

1 3. Consider claim 7, the APA teaches the voice interactive system as claimed in 
claim 6, wherein said response module includes an operation control module for 
generating a signal corresponding to the result of the semantic recognition performed by 
said semantic recognition module, said operation control module being adapted to 
transmit the signal generated thereby to an electronic device such that the electronic 
device operates in response to the signal (In the example, it is assumed that the system 
keyword is "Jack", and the electronic device that incorporates the voice interactive 
system is a multi-media playback apparatus: User: Jack, activate the CD player. 
System: All right, I'll activate the CD player for you; spec page 3, lines 18-24.). 

14. Consider claim 8, the APA teaches the voice interactive system as claimed in 
claim 6, wherein said response module includes a voice response module for providing ' 
artificial voice response data corresponding to the result of the semantic recognition 
performed by said semantic recognition module (In the example, it is assumed that the 
system keyword is "Jack", and the electronic device that incorporates the voice 
interactive system is a multi-media playback apparatus: User: Jack, activate the CD 
player. System: All right, I'll activate the CD player for you; spec page 3, lines 18-24.). 



Application/Control Number: 10/781,880 Page 12 

Art Unit: 2626 

15. Consider claim 9, Smith teaches the voice interactive system as claimed in Claim 
6, wherein said response module includes an image response module for providing 
image data that corresponds to the result of the semantic recognition performed by said 
semantic recognition module (FIGS. 10a-10f show example screen displays according 
to the present invention. FIG. 10a shows the basic screen display with the users 
selections to dial by name 100 or by number 200. The name list selection 300 allows 
the user to view the directory of names, such as the directory shown in FIG. 11. After an 
attention word is entered into the system, icon 300 shown in FIG. 10b is displayed on 
the screen to indicate to the user that the system is on and waiting for a command. 
Throughout processing the telephone call, icon 300 is displayed whenever it is time for 
user input; paragraph 0070.). 

16. Consider claim 12, the APA teaches a selective voice recognition system (a 
system is inherent in the APA in order to implement features below) comprising: 

a detecting module adapted for performing voice recognition upon an input voice 
signal to detect presence of a predetermined keyword (a confirmation procedure is 
required in the Talk-to-Talk mode when issuing a voice command. In the Talk-to-Talk 
mode, the confirmation procedure involves the presence of a keyword in the issued 
voice command so as to minimize occurrence of unwanted responses; spec page 3, 
lines 8-12.); 

a semantic recognition module coupled to and controlled by said detecting 
module so as to switch operation from a disabled mode to an enabled mode, where said 
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semantic recognition module performs semantic recognition upon the input voice signal, 
when the presence of the predetermined keyword in the input voice signal is detected 
by said detecting module (voice recognition is performed upon an input voice signal, 
and a responsive command is subsequently retrieved from a database based on the 
recognition result, spec page 2 lines 2-5. Talk to talk operates similarly as push to talk 
except a keyword is used; spec page 3, lines 7. Semantic analysis is inherent in the 
system of example of page 3 of talk-to-talk system.); 

a response module coupled to and controlled by said semantic recognition 
module so as to generate a response according to result of the semantic recognition 
performed by said semantic recognition module (voice recognition is performed upon an 
input voice signal, and a responsive command is subsequently retrieved from a 
database based on the recognition result, spec page 2 lines 2-5. Talk to talk operates 
similarly as push to talk except a keyword is used; spec page 3, lines 7.). 

But the APA does not specifically teach: 

a timer module which operates simultaneously with operation of said semantic 
recognition module in the enabled mode so as to calculate an idle time between a 
current input voice signal and a previous input voice signal, and so as to determine 
whether the idle time calculated thereby is larger than a predetermined threshold; and 

a mode switching module coupled to said timer module and said detecting 
module, said mode switching module enabling said detecting module to switch 
operation of said semantic recognition module from the enabled mode back to the 
disabled mode upon detection by said timer module that the idle time between the 
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current input voice signal and the previous input voice signal is larger than the 
predetermined threshold. 

In the same field of voice control systems, Smith teaches: 
a timer module which operates simultaneously with operation of said semantic 
recognition module in the enabled mode so as to calculate an idle time between a 
current input voice signal and a previous input voice signal, and so as to determine 
whether the idle time calculated thereby is larger than a predetermined threshold 
(Typically, the wait state of step 620 will last a predetermined amount of time, such as 
three seconds, and if no input is received (step 650), the system will display and ask the 
user verbally to input what type of command they wish to enter such as a command to 
call a specific name, phone number or to review a directory of names (step 655). 
Processing then returns to the command wait step 620. However, if no command is 
input by the user again within the predetermined amount of time (step 650), the system 
will go back to step 600 and await another attention word or key; paragraph 0060.)); and 

a mode switching module coupled to said timer module and said detecting 
module, said mode switching module enabling said detecting module to switch 
operation of said semantic recognition module from the enabled mode back to the 
disabled mode upon detection by said timer module that the idle time between the 
current input voice signal and the previous input voice signal is larger than the 
predetermined threshold Typically, the wait state of step 620 will last a predetermined 
amount of time, such as three seconds, and if no input is received (step 650), the 
system will display and ask the user verbally to input what type of command they wish 
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to enter such as a command to call a specific name, phone number or to review a 
directory of names (step 655). Processing then returns to the command wait step 620. 
However, if no command is input by the user again within the predetermined amount of 
time (step 650), the system will go back to step 600 and await another attention word or 
key; paragraph 0060.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the timing method of Smith with the voice control system of 
the APA in order to provide a mechanism to disable the semantic analysis when no 
response is given in order to prevent unwanted responses. 

17. Consider claim 1 3, the APA teaches a an electronic device (a device is inherent 
in the APA in order to implement features below) comprising: 

a detecting module adapted for performing voice recognition upon an input voice 
signal to detect presence of a predetermined keyword (a confirmation procedure is 
required in the Talk-to-Talk mode when issuing a voice command. In the Talk-to-Talk 
mode, the confirmation procedure involves the presence of a keyword in the issued 
voice command so as to minimize occurrence of unwanted responses; spec page 3, 
lines 8-12.); 

a semantic recognition module coupled to and controlled by said detecting 
module so as to switch operation from a disabled mode to an enabled mode, where said 
semantic recognition module performs semantic recognition upon the input voice signal, 
when the presence of the predetermined keyword in the input voice signal is detected 
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by said detecting module (voice recognition is performed upon an input voice signal, 
and a responsive command is subsequently retrieved from a database based on the 
recognition result, spec page 2 lines 2-5. Talk to talk operates similarly as push to talk 
except a keyword is used; spec page 3, lines 7. Semantic analysis is inherent in the 
system of example of page 3 of talk-to-talk system.); 

a response module coupled to and controlled by said semantic recognition 
module so as to generate a response according to result of the semantic recognition 
performed by said semantic recognition module (voice recognition is performed upon an 
input voice signal, and a responsive command is subsequently retrieved from a 
database based on the recognition result, spec page 2 lines 2-5. Talk to talk operates 
similarly as push to talk except a keyword is used; spec page 3, lines 7.). 

But the APA does not specifically teach: 

a sound pickup module adapted for receiving an input voice signal (although it is 
practically inherent); 

a timer module which operates simultaneously with operation of said semantic 
recognition module in the enabled mode so as to calculate an idle time between a 
current input voice signal and a previous input voice signal, and so as to determine 
whether the idle time calculated thereby is larger than a predetermined threshold; and 

a mode switching module coupled to said timer module and said detecting 
module, said mode switching module enabling said detecting module to switch 
operation of said semantic recognition module from the enabled mode back to the 
disabled mode upon detection by said timer module that the idle time between the 



Application/Control Number: 1 0/781 ,880 Page 1 7 

Art Unit: 2626 

current input voice signal and the previous input voice signal is larger than the 

predetermined threshold. 

In the same field of voice control systems, Smith teaches: 

a sound pickup module adapted for receiving an input voice signal (figure 3, 

microphone 3274.) 

a timer module which operates simultaneously with operation of said semantic 
recognition module in the enabled mode so as to calculate an idle time between a 
current input voice signal and a previous input voice signal, and so as to determine 
whether the idle time calculated thereby is larger than a predetermined threshold 
(Typically, the wait state of step 620 will last a predetermined amount of time, such as 
three seconds, and if no input is received (step 650), the system will display and ask the 
user verbally to input what type of command they wish to enter such as a command to 
call a specific name, phone number or to review a directory of names (step 655). 
Processing then returns to the command wait step 620. However, if no command is 
input by the user again within the predetermined amount of time (step 650), the system 
will go back to step 600 and await another attention word or key; paragraph 0060.)); and 

a mode switching module coupled to said timer module and said detecting 
module, said mode switching module enabling said detecting module to switch 
operation of said semantic recognition module from the enabled mode back to the 
disabled mode upon detection by said timer module that the idle time between the 
current input voice signal and the previous input voice signal is larger than the 
predetermined threshold Typically, the wait state of step 620 will last a predetermined 
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amount of time, such as three seconds, and if no input is received (step 650), the 
system will display and ask the user verbally to input what type of command they wish 
to enter such as a command to call a specific name, phone number or to review a 
directory of names (step 655). Processing then returns to the command wait step 620. 
However, if no command is input by the user again within the predetermined amount of 
time (step 650), the system will go back to step 600 and await another attention word or 
key; paragraph 0060.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the timing method of Smith with the voice control system of 
the APA in order to provide a mechanism to disable the semantic analysis when no 
response is given in order to prevent unwanted responses. 

18. Consider claim 14, the APA teaches the voice interactive system as claimed in 
claim 13, wherein said response module includes an operation control module for 
generating a signal corresponding to the result of the semantic recognition performed by 
said semantic recognition module, said operation control module being adapted to 
transmit the signal generated thereby to an electronic device such that the electronic 
device operates in response to the signal (In the example, it is assumed that the system 
keyword is "Jack", and the electronic device that incorporates the voice interactive 
system is a multi-media playback apparatus: User: Jack, activate the CD player. 
System: All right, I'll activate the CD player for you; spec page 3, lines 18-24 ). 
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19. Consider claim 15, the APA teaches the voice interactive system as claimed in 
claim 13, wherein said response module includes a voice response module for providing 
artificial voice response data corresponding to the result of the semantic recognition 
performed by said semantic recognition module (In the example, it is assumed that the 
system keyword is "Jack", and the electronic device that incorporates the voice 
interactive system is a multi-media playback apparatus: User: Jack, activate the CD 
player. System: All right, I'll activate the CD player for you; spec page 3, lines 18-24.). 

20. Consider claim 16, Smith teaches the voice interactive system as claimed in 
claim 13, wherein said response module includes an image response module for 
providing image data that corresponds to the result of the semantic recognition 
performed by said semantic recognition module (FIGS. 10a-10f show example screen 
displays according to the present invention. FIG. 10a shows the basic screen display 
with the users selections to dial by name 100 or by number 200. The name list selection 
300 allows the user to view the directory of names, such as the directory shown in FIG. 
1 1 . After an attention word is entered into the system, icon 300 shown in FIG. 10b is 
displayed on the screen to indicate to the user that the system is on and waiting for a 
command. Throughout processing the telephone call, icon 300 is displayed whenever it 
is time for user input; paragraph 0070.). 
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21 . Claims 10 and 1 1 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over the APA in view of Smith as applied to claim 6 above, and further in view of 
Bellegarda et al (US Patent 6,285,785). 

22. Consider claim 10, APA and smith teaches the voice interactive system as 
claimed in claim 6, but does not specifically teach wherein said detecting module 
includes: 

a feature parameter retrieving unit for retrieving feature parameters of the input 
voice signal; 

a voice model building unit coupled to said feature parameter retrieving unit for 

building voice models with reference to the feature parameters retrieved by said feature 

parameter retrieving unit; 

a keyword voice modeling unit for storage of keyword voice models; and 

a voice model comparing unit coupled to said voice model building unit and said 

keyword voice modeling unit for comparing similarity among built voice models and the 

keyword voice models. 

In the same field of speech recognition, Bellegarda teaches: 

a feature parameter retrieving unit for retrieving feature parameters of the input 

voice signal (With the use of a microphone and A/D converter as the speech transducer 

22 the speech is typically sampled at a 20 kHz rate and is Fast Fourier transformed; 

column 6, line 46.); 
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a voice model building unit coupled to said feature parameter retrieving unit for 
building voice models with reference to the feature parameters retrieved by said feature 
parameter retrieving unit (The role of the Markov word model for speech is to represent 
the pronunciation of a word. The parameters of the Markov model are trained by relating 
the acoustic signal produced by the speaker to the word uttered; column 7, line 50.); 

a keyword voice modeling unit for storage of keyword voice models (inherent if 
combined with APA and smith. In order to apply Markov models to the keyword, a 
modeling unit is inherent.); and 

a voice model comparing unit coupled to said voice model building unit and said 
keyword voice modeling unit for comparing similarity among built voice models and the 
keyword voice models (For speech, the likelihood of each Markov word model 
producing the input utterance is computed, after pruning by the 3-gram LM 38a, and 
those words resulting in the highest scores are placed on the list of word candidates 
(L.sub.1). In both cases (speech and handwriting), the size of the list is preferably 
statically set using a predetermined reasonable maximum size, such as 15, or may be 
set dynamically using a threshold. In the latter case, all words whose likelihoods fall 
within the selected threshold are maintained on the list, and the other words discarded; 
collumn8, lines 48-58.). 

Therefore it would have been obvious to one of ordinary skill in the art at the time 
of the invention to combine the modeling system of Bellegarda with the voice 
recognition system in order to provide a robust method to enable word recognition. 
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23. Consider claim 1 1 , Bellegarda teaches the voice interactive system as claimed in 
Claim 10, wherein said semantic recognition module includes a database containing a 
plurality of voice model samples, and a voice model comparing unit coupled to said 
detecting unit similarity among said database for comparing the built voice models and 
the voice model samples (For speech, the likelihood of each Markov word model 
producing the input utterance is computed, after pruning by the 3-gram LM 38a, and 
those words resulting in the highest scores are placed on the list of word candidates 
(L.sub.1). In both cases (speech and handwriting), the size of the list is preferably 
statically set using a predetermined reasonable maximum size, such as 15, or may be 
set dynamically using a threshold. In the latter case, all words whose likelihoods fall 
within the selected threshold are maintained on the list, and the other words discarded; 
collumn8, lines 48-58. In order to do this, a database must used in order to provide the 
models to prune from.). 

Conclusion 

The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure is listed on the Notice of References Cited. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Douglas C. Godbold whose telephone number is (571) 
270-1451 . The examiner can normally be reached on Monday-Thursday 7:00am- 
4:30pm Friday 7:00am-3:30pm. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Patrick Edouard can be reached on (571) 272-7603. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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